Abstract: As number of user’s increases data also increases. The phrase big data refers to bulk volume of data which is complicated in nature because it involves both structured and unstructured type of data which is difficult to analyze. Many different sources like social media postings, sensors which gives climate information, digital information etc contribute huge amount of data to the big data. To extract useful patterns and readable patterns from the big data it is necessary to use data mining technique. Big data stream refers to the data stream which is large in variety, veracity and velocity. Data processing is not an easy job unless of identifying, responding and locating the data hence it is more challenging and difficult it has to be done in well automated manner.

Keywords: Hadoop, no sql, pig, spark, 3 v’s.